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Abstract 

We show that if S = (V, E) is a regular bipartite graph for which the ex- 
pansion of subsets of a single parity of V is reasonably good and which satisfies 
a certain local condition (that the union of the neighbourhoods of adjacent ver- 
tices does not contain too many pairwise non-adjacent vertices), and if M is a 
Markov chain on the set of proper 3-colourings of E which updates the colour 
of at most p\V\ vertices at each step and whose stationary distribution is uni- 
form, then for p ~ .22 and d sufficiently large the convergence to stationarity 
of A4 is (essentially) exponential in \ V\. In particular, if E is the d-dimensional 
hypercube Qd (the graph on vertex set {0, l} d in which two strings are adjacent 
\ if they differ on exactly one coordinate) then the convergence to stationarity of 

CO the well-known Glauber (single-site update) dynamics is exponentially slow in 

2 d j (\/dlog d). A combinatorial corollary of our main result is that in a uniform 
3-colouring of Qd there is an exponentially small probability (in 2 d ) that there is 
a colour i such the proportion of vertices of the even subcube coloured i differs 
■ from the proportion of the odd subcube coloured i by at most .22. Our proof 

combines a conductance argument with combinatorial enumeration methods. 
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1 Introduction and statement of the result 



Markov chain Monte Carlo algorithms (MCMC's) occur frequently in computer sci- 
ence in algorithms designed to sample from or estimate the size of large combina- 
torially defined structures; they are also used in statistical physics and the study of 
networks to help understand the behavior of models of physical systems and networks 
in equilibrium. In this paper we study a class of natural MCMC's that sample from 
proper 3-colourings of a regular bipartite graph. 

Let £ = (V, E) be a simple, loopless, finite graph on vertex set V and edge set E. 
(For graph theory basics, see e.g. [1], [9].) For a positive integer q write C q = C q (Tl) 
for the set of proper g-colourings of S; that is, 



Let n q = 7r g (S) be the uniform probability distribution on C q . 

The notion of g-colouring is fundamental in graph theory; see e.g. [3, Chapter 
5] for a survey. The notion also occurs in statistical physics; the pair (C q ,n q ) is the 
zero-temperature limit of the g-state antiferromagnetic Potts model (see e.g. [27, 28J). 

Glauber dynamics for proper g-colourings is the single-site update Markov chain 
M. q = Ai q (E) on state space C q with transition probabilities P g {xi 1X2)1 X11X2 £ C q , 
given by 



We may think of M. q dynamically as follows. From a q colouring \i choose a vertex 
v uniformly from V and a colour j uniformly from {0, . . . ,q — 1}. Then define a 
function x' : V — > {0, . . . , q — 1} by 



Finally, move to x' if x' is a proper g-colouring, and stay at x otherwise. (A variant of 
Glauber dynamics chooses j uniformly from {0, . . . , q — 1} \ {x{ w ) '■ w ~ v }, ensuring 
that x' is always a proper colouring. This changes the transition probabilities, but 
does not significantly change the qualitative behavior of the chain.) 

For all E the chain M. q is aperiodic, but it is not in general irreducible (consider, 
for example, £ = K q , the complete graph on q vertices), and so not ergodic. In the 
case when Ai q is ergodic (e.g., when E has maximum degree A and q > A + 2; see 
[TB] ) it is readily checked that it has (unique) stationary distribution n q . (One only 
has to check that Ai q is reversible with respect to n q \ that is, that it satisfies the 
detailed balance equations ir q (xi)Pq{Xi, X2) = ^ q {X2)P q {X2,Xi) fo r all Xi>X2 G C q .) 



C, = {x:V(E)->{0,1,..., 



q - 1} : xy G E(E) => x(x) ^ x(v)}- 




ii\{veV: X i(v)^X2{v)}\>l 
X\{veV: X i(v)?X2(v)}\ = l 

if Xi = X2- 
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A natural and important question to ask about Ai q in this case is how quickly it 
converges to its stationary distribution. We define the mixing time rj^ q of A4 q by 

r Mq = min jt : d TV (Pg,n q ) < i j 

where P q {Xi x') is the probability of moving from x to x' i n t steps and 

dTv{P q ,rr q ) = max - V |P*(% 1 ,X 2 ) - * q (X2)\ 

X2ec g 

is total variation distance. The mixing time of Ai q captures the speed at which the 
chain converges to its stationary distribution: for every e > 0, in order to get a 
sample from C q which is within e of n q (in total variation distance), it is necessary 
and sufficient to run the chain from some arbitrarily chosen distribution for some 
multiple (depending on e) of the mixing time. For surveys of issues related to the 
mixing time of a Markov chain, see e.g. [U EU [22] . 

Jerrum [15] and Salas and Sokal [21] independently showed that if E has maximum 
degree A and q > 2 A then there is rapid mixing of the Glauber dynamics; i.e., 
tm (E) is polynomial in \V\. In fact, they showed that the mixing time is optimal 
(0(|V| log | V|)). Bub ley and Dyer [7] showed that there is rapid mixing for q = 2 A 
and Molloy [20J improved this to optimal mixing. In a breakthrough result Vigoda 
[29] showed rapid mixing for q > (11/6) A. More recently Dyer, Greenhill and Molloy 
[llj exhibited optimal mixing for q > (2 — e)A for a small positive constant e. 

In this paper our aim is to explore the limitations of Glauber dynamics as a 
sampling tool by exhibiting a class of graphs for which the mixing time is essentially 
as far from optimal as possible. In this direction, Luczak and Vigoda [19] have 
exhibited families of planar graphs for which M. q is not rapidly mixing for each fixed 
q > 3 and families of bipartite graphs with maximum degree A for which Ai q is not 
rapidly mixing for any 3 < q < 0(A/ log A). A drawback of these negative results is 
that the families exhibited consist of random graphs. Here, we attempt to remedy this 
by constructing explicit families of graphs for which Glauber dynamics is inefficient. 
We focus exclusively on the case q = 3 (we cannot see at the moment how to apply 
our techniques to any q > 3) and E regular bipartite. Specifically, we establish certain 
local and expansion conditions in a regular bipartite graph E that force t Ms ^ to be 
(almost) exponential in \V\. The discrete hypercube is among the families of graphs 
which satisfy our conditions. 

Our techniques actually apply to the class of p-local chains (considered in [6] and 
also in [TU], where the terminology p\V\ -cautious is employed) for suitably small p. A 
Markov chain M. on state space C q is p-local if in each step of the chain at most p\V\ 
vertices have their colour changed; that is, if 

p m {xu X2) ^ o =► \{ v e v : X i{v) + xMS\ < Pin 
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Before stating our main result, we establish some notation. From now on, E = 
(V, E) will be a <i-regular bipartite graph with partition classes £ and O. For u,v G V 
we write u ~ v if there is an edge in £ joining -u and t> . Set N(u) = {w £ V : w ~ u} 
(N(u) is the neighbourhood of w) and for A C V set iV(A) = U^^A^k;). For AC^! 
(or A CO) set 

[A] = {x e y : jv(x) c iV(A)} 

(we think of [A] as an external closure of A) and say that such an A is small if 
| [A] | < |V|/4. Note that N(A) determines [A] but not A itself. 
Define the bipartite expansion of £ by 

5(E) = min |^j|l^M : A C 5 (or 0) small, A ^ 

note that < 5 < 1. The second inequality is clear. To see the first, note that since 
E is regular and bipartite it has a perfect matching, and so satisfies 

\X\ < \N(X)\ for all X C £ or C. (1) 

That < 5 now follows from \[A]\ < \N([A])\ = \N(A)\. The bipartite expansion 
constant is a measure of the proportion by which the neighbourhood size of a small 
set exceeds the size of the set itself, in the worst case. 

Finally, define the locality £(£) of E to be the largest £ > such that for all 
x ~ y G V and for all independent sets / (sets of vertices spanning no edges) in the 
subgraph of E induced by N(x) U N(y) we have \I\ < 2d — I. (So, for example, if E is 
the (i-regular tree then £(£) = 2 since the subgraph induced by the neighbourhoods 
of adjacent vertices contains an independent set of size 2d — 2; whereas if E is the 
complete rf-regular bipartite graph then £(£) = d.) 

Our main result is the following. Recall that H(x) = —x log x — (1 — x) log(l — x) 
is the usual binary entropy function. 

Theorem 1.1 Fix p > satisfying H(p)+p < 1. There are constants do, C\, C[, C 2 > 

all depending on p such that if E is a d-regular bipartite graph on N vertices with 
bipartite expansion 5 and locality £ > satisfying 

(C^d C[hgd\ 
5>max|^— ,— _j (2) 

and with d > d and if JA(E) is an ergodic p-local Markov chain on state space C 3 (E) 
with stationary distribution 7r 3 (E) then 



tm{y.) > exp 2 



C 2 N5 
\ogd 



Note that for all p < .22 we have H(p) + p < 1. Here and throughout we use "log" 
for log 2 and write exp 2 x for 2 X . 
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Remark 1.2 The second inequality in |I)) implies £ > Q(logd/5). This condition 
appears in the derivation of where it is only used in the weaker form t = 

(which follows since 5 < 1). It is used in a more essential way in the derivation 
of / flgj) where it serves to limit, somewhat artificially, the number of 3-colourings of 
a bipartite graph with a given pre-image of 0. We expect that Theorem li.il should 
remain true with the second inequality in (TJ|) removed. 

We now return to Glauber dynamics. This changes the colour of at most one 
vertex at each step, and so (as long as the underlying graph has at least five vertices) 
it is a p-local chain for p = .2. Fixing p to this value, all of the constants in Theorem 
11.11 become absolute, and we have the following corollary. 



Corollary 1.3 There are constants do,Ci,C[,C2 > such that if £ satisfies the 
conditions of Theorem \l.l\ and if the Glauber dynamics chain Ai^H) is ergodic then 



tx 3 (k) > exp 2 



C 2 NS 
\ogd 



Let us apply Theorem 11.11 to the case £ = Qd, the <i-dimensional Hamming cube. 
This is the graph on vertex set {0, l} d with x ~ y iff x and y differ on exactly one 
coordinate. For d > 2 we have £(Qd) = d (the graph induced by the union of the 
neighbourhoods of adjacent vertices is a perfect matching, so all independent sets are 
of size at most <f), and 5(Qd) = ^(1/ 'yd) (see e.g. [18j Lemma 1.3]). So the following 
is an immediate corollary of Theorem 11.11 

Corollary 1.4 Fix p > satisfying H(p) +p < 1. There is a constant C = C(p) > 
such that for all d > 2, if M.{Qd) is an ergodic p-local Markov chain on state space 
CziQd) with stationary distribution n^{Qd) then 



t m > exp 2 



C2 C 



\/d\ogd 



In particular this result applies to the Glauber dynamics chain, although in this case 
it is not necessary to hypothesize ergodicity. 

Corollary 1.5 There is a constant C > such that for all d > 2, 



f C2 d ) 



Proof: In the presence of Corollary 11.41 it suffices to show that the chain Ais(Qd) 
is ergodic. We will show that if xi is a 3-colouring of Qd with Xi( v o) = for some 
v o G £ then there is a sequence of steps in the Glauber dynamics chain that takes Xi 
to a 2-co louring % 2 of Qd with X2( v ) — for all v E £. This suffices, since it is clear 
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that any one of the six 2-colourings of Qd can be reached from any other via steps in 
the chain. 

We make use of a correspondence between proper 3-colourings of Qd and homo- 
morphisms from Qd to Z that send vq to 0. Formally, set 

jrvo = {f . V z . f ( Vo) = o an d x ~ y \f( x ) - f(y)\ = 1}. 

(This set was introduced in [2] and further studied in [THE].) Then, as observed 
by Randall [23] . there is a bijection from J 1 "" to C3 := G C3 : x( u o) = 0} given 
by / — > $(/) where $(/)(f) = i iff /(f) = z (mod 3). Before verifying that this is 
indeed a bijection, we use the correspondence to establish the corollary. 

For / G J 71 " set = {/(f) :veV}. Now consider X i G C 3 T If |i2(^ -1 (xi))l = 
2, then we may take X2 = Xi an d we are done. If |i?($ _1 (xi))| = k > 2, then it 
suffices to exhibit a sequence of steps in the chain that takes Xi t° some X3 £ ^3° 

withl^- 1 ^))!^^-!. 

Without loss of generality we may assume that $ _1 (xi) takes on some strictly 
positive values. Let £ be the largest such value, and let f € V be any vertex satisfying 
= t- Note that ^ - 2 G i2($ -1 (xi)). Let / : V -> Z be the function that 
agrees with off v and satisfies /(f) = £ — 2. Since $ _1 (xi)(f) = £ — 1 

for all y G rV(f) and f 7^ v we have that / G J 7 *' and $(/) G C3 and that the 
Glauber dynamics chain permits a move from xi to $(/). But we also have that 
R(f) C iJC^-^Xi)) and |{f G V : /(f) = £}\ < \{v E V : ^(.XiM = t}\, so that 
by repeating the above described procedure m more times (where m = \{v G V : 
/(f) = £}\) we arrive at the desired xs- 

It remains to verify that $ is a bijection. That it is injective is clear. To see 
that it is surjective, consider x' 6 C3 . We shall construct from x' an / G J 7 '' with 
$(/) = x' by setting /(fo) = and then extending / level by level, where the k th level 
of Qd (k = 0, . . . , d) is Ck ■= {v E V : dist(v, fo) = k} (here we are using dist(-, •) for 
the usual graph distance). Note that for v G Ck, N(v) C U £fc+i and that for 
/ G J 7 ^ the values that / takes on Ck must all have the same parity. 

So suppose we have specified / up to Ck for some < k < d—1. Consider v G Ck+i- 
If / is constant on N(v) D Ck then (since the construction of / has succeeded up to 
Ck) we also have that x' is constant on N(v) fl Ck with x'iy) = /(?/) (mod 3) for 
all y G N(v) fl In this case we choose /(f) such that \f(v) — /(f) | = 1 for all 
y G N(v) H C k and x» = f(v) (mod 3). 

If / is not constant on N(v) fl Ck, then we claim that there is some £ G Z such 
that / takes on only the values £ and £ + 2 on iV(f ) fl Ck- For if not, then we have 
2/1)2/2 G iV(f) PI £fc with |/(yi) — /(j/2)| > 4. But by the structure of Qd there must 
be f' G £fc_i with f' ~ y\ and f' ~ which forces |/(fi) — /(f2)| < 2. This 
contradiction establishes the two- value claim. We now set /(f) = £ + 1, allowing the 
construction to continue. Since Ck+i is an independent set in Q d , we may repeat the 
above- described procedure on each vertex of Ck+i independently, thus extending the 
construction of / to all of Ck+i- □ 
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Remark 1.6 Glauber dynamics for q-colourings of Qd is not in general ergodic for 
3 < q < A(Qd) + 1. Indeed, it is straightforward to construct a ^-colouring x of Q 3 
which is frozen in the sense that P±{XiX') — f or a ^ X' 7^ Xi one simply assigns the 
colours 0,1,2 and 3 to a particular vertex and its three neighbours and then extend 
to a colouring of the whole of Q 3 according to the rule that on each face (4-cycle) of 
Q 3 all of the colours 0,1,2 and 3 must appear. 

Remark 1.7 While this paper was under review, Galvin and Randall /73J/ used meth- 
ods different to those of the present work to extend Corollary 11.51 to the discrete torus 
Ti4, the graph on vertex set {0, . . . , L — l} d in which two strings are adjacent if they 
differ on exactly one coordinate and differ by 1 (mod L) on that coordinate. The main 
result of /73J/ is that for L > 4 even and d large, the Glauber dynamics chain M. 3 on 
C 3 {T L4 ) satisfies t Ms > exp {L^ 1 / (d 4 log 2 L)} . 

We prove Theorem 11.11 via a well-known conductance argument (introduced in 
[T6]). A particularly useful form of the argument was given by Dyer, Frieze and 
Jerrum [10]. Let Ai be an ergodic Markov chain on state space Q with transition 
probabilities P and stationary distribution 71. Let A C Q and M C Q \ A satisfy 
ir (A) < 1/2 and Uj 6 A,w 2 G 0\(iU M) => P{u u u 2 ) = 0. Then from [10] we have 

n(A) 



7r(M)' 



We may think of M as a bottleneck set through which any run of the chain must pass 
in order to mix; if the bottleneck has small measure, then the mixing time is high. 
Now let us return to the setup of Theorem 11.11 Set 

C V,o = c V,o (s) = {xe c 3 : \\ x -\0) n£ I - 1x^(0) n 0\\ < pN/2} 
(Cg ,p '° is the set of 3-colourings that are balanced with respect to 0) and 

C$>r>° = Cf ' P (S) = { X E C 3 : |x _1 (0) H £\ > |x _1 (0) H 0\ + P N/2}. 



->£,p,0\ 

M. changes the colour of at most pN vertices in each step, we have that if xi £ Cf ' p 



We may assume without loss of generality that 7r 3 (C 3 ' p ' ) < 1/2. Notice that since 

-,£, ' 
'3 



and X2 £ C3 \ (C 3 ' p ' U C 3 ' p ' ) then Pm(xi ? X2) = 0. We therefore have 



7r 3 (C b 3 ' p ) ~ 8\C b 3 p ^ 



the second inequality coming from the trivial lower bound |Cf' p '°| > 2 N ^ 2 (consider 
those x with xi v ) = f° r an v E £). Theorem 11.11 thus follows from the following 
theorem, whose proof will be the main business of this paper. 
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|Cn<exp 2 {^(l 



Theorem 1.8 Fix p > satisfying H(p)+p < 1. There are constants d , C\, C[, C 2 > 
all depending on p such that if £ is a d-regular bipartite graph on N vertices with 
bipartite expansion 5 and locality i satisfying (d)) and with d > do then 

C 2 5 
logo? / 

Theorem 11.81 says more about the structure of C 3 than just that the dynamics 
mixes slowly. From it, we can infer that for £ and p satisfying the conditions of the 
theorem, C 3 breaks naturally into six sets in such a way that once a p-local chain 
enters one of these dominant sets, it tends to remain there for an exponential time. 
These sets are characterized by a predominance of one (of three) colours on one (of 
two) partition classes. Indeed, defining C 3 P ' 1 and C 3 P ' 2 by analogy with C 3 P '° and 
setting TZ 3 = C 3 \ U^ =0 C 3 p '\ we may partition 1Z 3 into six pieces by 

/C 3 — U( Xi y tZ ) E {£ t o}3\{(S,S,£),(0,0,0)}'<'3 

where Tlf v ' z) = { X G TZ 3 : X G C^ pfl n Cf p ' 1 n C z 3 ' p ' 2 }. If a p-local chain leaves Tlf v ' z) 
(for any (x,y,z)) it must enter U 2 =0 C 3 p,t which, by Theorem 11.11 and a union bound, 
has exponentially small measure. 

Before turning to the proof of Theorem 11.81 we pause to give a pleasing combina- 
torial corollary in the special case S = Qd- 

Corollary 1.9 Fix p satisfying H(p) + p < 1. There is a constant C = C(p) > 
such that for all d>2, if \ is a uniformly chosen ^-colouring of Qd then 



Phi: 



\ x -\i)r\£\ \ x - 1 {i)r\0\ 



\s\ o 



< p ) < exp 2 



C2 C 



\/d\ogd 



In other words, the typical 3-colouring of Qd exhibits strong S/O imbalance on all 
colours. 

Proof of Corollary \l.£k As previously observed, £(Qd) = d and 5(Qd) < £1(1 /Vd), so 
is satisfied for large enough d. It follows that there is a C'(p) such that for large 
enough d and for each i = 0, 1, 2, 

\{X G C 3 (Q d ) : |IX -1 W n ^1 - n °W < P 2 ^ 1 )! < ex P2 {2"^ " • 

Using 2 2d 1 as a lower bound on IC3 (Qrf) I (consider those colourings for which x _1 (0) = 
S) we obtain 

P (lix- 1 ^) nS\- Ix-'ii) n 0|| < p2 d - x ) < exp 2 

L v « log d ) 

for x chosen uniformly from C 3 (Qd)- The stated bound follows for large d (with a 
constant C" slightly larger than C) via a union bound and the fact that \£\ = \0\ = 
2 d ~ 1 ; we may obtain the bound for all d by appropriately modifying the constant C". 

□ 
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2 Proof of Theorem 



1.8 



2.1 Overview of the proof 

In this section we give an informal overview of the proof of Theorem 11.81 

We bound the number of balanced 3-colourings by bounding, for each pair E C £, 
O C O with \\E\ — \0\\ < pN/2 and E ^ O (that is, with no edge in £ joining E and 
O), the number of 3-colourings of £ in which E U O is the pre-image of 0. We then 
sum over all choices of E and O. 

How many ways are there to 3-colour £ given that E U O is the pre-image of 0? 
Write 1(E) for the set of vertices in N(E) all of whose neighbours are in E, and 1(0) 
for the neighbours of O all of whose neighbours are in O. There are two choices for 
each vertex in 1(E) and two for each vertex in 1(0), as well as two choices for each 
component in the graph obtained from £ by removing E, O, 1(E) and 1(0) (each 
such component is a connected bipartite graph), all choices independent. The first 
step in the proof is an easy graph theory lemma that shows that the contribution 
from components in £ — (E U O U 1(E) U 1(0)) is negligible. (This step uses the 
locality of £ in an essential way.) This reduces the problem of bounding the number 
of balanced 3-colorings to the problem of estimating a sum of the form 

2 lI(E)l+lI{ ° )l . (3) 

E,0:E^O 

When \E\ and \0\ are both small (less than cN for a suitably small constant c) a 
naive count suffices to give an appropriate bound. For larger E and O, we must work 
harder. We partition the set of pairs (E,0) according to the parameters a = \[E]\, 
g = \N(E)\, b = \I(E)\, h = \N(I(E))\, V = \I(0)\ and ti = \N(I(0))\. Within each 
class, each pair gives the same contribution (2 b+b ) to the sum in ([3]). The main point 
of the proof is an estimate on the size of % — {(E, O) : (E, O) has parameters a, g, 
b, h, b' and h'} of the form 

for sufficiently large d = d(p) and suitable c = c(p). The proof is completed by 
invoking (J3J) and summing over all choices of a, g, et cetera. 

The proof of PJ involves the idea of approximation. To bound \7i\, we produce 
a small set U with the properties that each (E, O) G H is approximated (in an 
appropriate sense) by some U G U, and for each U G U, the number of (E, O) G H 
that could possibly be approximated by U is small. (Each U G W will consist of six 
parts; one each approximating E, N(E), 1(E), N(I(E)), 1(0) and N(I(0)).) The 
product of the bound on \U\ and the bound on the number of those (E, O) G "H that 
may be approximated by any U is then a bound on \7i\. 
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The main inspiration for our approximation scheme is the work of A. Sapozhenko, 
who, in [25], gave a relatively simple derivation for the asymptotics of the number of 
independent sets in Qd, earlier derived in a more involved way in [18]. We produce 
the set U by appealing to a lemma from [13] where a similar approximation scheme 
was used to show that the mixing time of Glauber dynamics for the hard-core model 
on Qd with activity A is (essentially) exponential in 2 d for large enough A. The proof 
that each U ElA approximates only a small number of (E, O) G % is a modification of 
a similar proof from [12] in which it is shown that a uniformly chosen homomorphism 
from Qd to Z almost surely takes on at most 5 values, and also that the number of 
proper 3-colourings of Qd is asymptotic to 2e2 2 as d goes to infinity. 

2.2 The proof 

We begin by establishing some more notation. From now on, we write M for N/2. 
For A C £ and B C O write A ^ B if for all x G A and y G B, x ^ y (this is 
equivalent to both N(A) R B = and N(B) D A = 0). For 5 C F write d s (u) for 
|iV(it) fl 5 1 and comp(S') for the number of components of the subgraph induced by 
S. Finally for T C £ (or O) set 

J(T) = {x G iV(T) : jV(x) C T} (= {x G V : iV(x) C T}). 

We think of I(T) as an internal closure of N(T). Note that for all T C £ (or (9), 
[I(T)} = J(T), J(T) C JV(T) and N(I(T)) C T. 
For x G Cg'"' set 

J E = x" 1 (o)n^, 
o = x _1 (o)nO, 
/ = /(E), 

J = J(O) 

and 

i2 = V\(£UOU/U J). 

We assume the convention that whenever E and O have been specified, /, J and R 
will be used as shorthand for 1(E), 1(0) and V\(EUOUlUJ). 
For E C £ and O C set 

C 3 (£, O) = { X G C 3 : £7(x) = £, 0(x) = O}. 

Note that C 3 (E, O) ^ iff E ^ O. For C 3 (E, O) ^ we have 

|C 3 (£,0)| = 2l / l+l J l +comp ^. 

To see this, note that once we have specified that the set of vertices coloured is 
E U O, we have a free choice between 1 and 2 for the colour at x G / U J, with each 
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choice independent. This accounts for the factor 2' / ' + ' J L The subgraph induced by 
R breaks into comp(R) components, each of which is bipartite and may be coloured 
in exactly two ways using the colours 1 and 2. This accounts for the factor 2 corap ( R \ 
We therefore have 

l^v.Oj _ y-^ 2l 7 l + l J l +comp ( fi ) 



ECS, OCO: 
\\E\-\0\\<pM, E^O 







ja'e 


0,\-p 



A key observation is the following. 
Proposition 2.1 For E ^O, comp(i?) < 2M/£. 

Proof: Let C be a component of V \ (E U O). If C = {v} consists of a single vertex, 
then (depending on the parity of v) we have either N(v) C E or N(v) C O and so 
v E 1(E) U 1(0). Otherwise, let vw be an edge of C. We have 

\(N(v) U N(w)) f](EUO)\<2d-£ 

(recall that EUO = x _1 ( ) 

is an independent set), and so \C\ > £. The result follows. 

□ 

We now decompose C 3 ' p '° into four pieces. Set 

:2a' + p + H(a') + H(p + a') < i (1 + p + . (5) 

Since -ff(p) + p < 1, a is a strictly positive constant depending on p. Set 

C 3 V ' (tm^£) = {X e C 3 V '° : |£| < aM, |E| < |0|} 

and define C^' p ' (triv,O) analogously. Set 

C 3 b ' p '°(nt,£) = {x G C b /'° \ (C b 3 ' p '°(triv,S) U C 3 b ' p '°(trw, O)) : small} 

(recall that E is small if \[E}\ < M/2) and define C 3 ' p,0 (nt, O) similarly. Since S has a 
perfect matching, it is easy to see that for x G C3 at least one of | -£7 1 < M/2, \0\ < M/2 
holds; moreover, it is straightforward to check that at least one of \[E]\ < M/2, 
\[0]\ < M/2 holds also; that is, that at least one of E, O is small, and so 

c b, P ,o = c^itriv, S) U C b 3 ' p '°(triv, O) U C b 3 ' p (nt, S) U C b ' p '°(nt, O). 

In what follows we make extensive use of a result concerning the sums of binomial 
coefficients which follows from the Chernoff bounds [S] (see also [3], p. 11): 

[m /M\ 

J2{ )<2 Him for/3<|. (6) 
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Also, since H(x) < 2xlogl/x for x < e 1 

[/?M] 



(7) 



We begin by bounding |Cg p '°(triv, £ )|. Noting that |/| < \E\ and |J| < |0| always 
(this follows from ([!])) we have 



\C b /'°(tmv,£)\ = 2l B l+l°l +com P^ 

ECE, OCO: 
\E\<aM, \0\<(a.+p)M 

< expJ^f} £ 2l " l+ ' 01 

^ ) ECE. OCO: 



ECE, OCO: 
\E\<aM, \0\<(a+p)M 



i=0 x 7 i=0 

< exp 2 |M r2a + p + fT(a)+fl"(p + a) + jjj (9) 

s ^{K 1 -^)} (io) 

for sufficiently large d = d(p). In (jSJ) we have used Proposition 12.11 In fl9]) we use ([6]) 
while in ( ITU]) we use flS} to obtain 

2 2 

2a + p + If (a) + if (p + a) + -<l-e + - 

for some e = e(a), and then use the second inequality in (j2J) (in the weak form that 
£ = cj(1)) to obtain 

2 5 
(note that S/logd = o(l)). Similarly, we have 

| C r (i ™,, )|<exp 2 {f (l-i^)} (ID 

for suitable d. 

Next we turn to C 3 p,0 (nt,£) and C3 P '°(nt, C). Without loss of generality we may 
assume \C b 3 ' p '°(nt, £)\ < \C b 3 ' p '°(nt, 0)\. Bearing (JTOD and ([II]) in mind, Theorem O 
now follows from 

|^W)l<exp 2 {f (l-^)} (12) 
for some constant c = c(p). 
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For integers a, g, b, h, b' and h', set 

ECS, OCO, E^O, \[E}\ = a, \N(E)\ = g, 



H{a,g,b,hXh') = ^E,0): |f| = ^ |]V(J)| = ^ , J{ = fo , |jV(J)| = h> 
Our main lemma is the following (cf. [14, Theorem 2.1]). 

Lemma 2.2 For each 0i, 02 > 0, there are constants oIq,c > depending on both f3\ 
and @2 such that the following holds. If G is a d-regular bipartite graph with bipartite 
expansion 5 > dr® 1 and d > do and if a satisfies P2M < a < M/2, then for any 
g, b, h, b' and h' we have 

\H(a, g, 6, h, b', h>) \ < exp 2 {m (l + ^f^) - b - b' - ^} . 

For G = E we may take 0\ = 2 (say). Note that for each {E,0) G C^' p '°(nt, S) 
with |/| = b and |J| = V , (E,0) G H(a, g,b, h,b' , h') for some a, g, h and h' with 
aM < a < M/2. With the steps justified below, we therefore have 

\C b 3 ' p '°(nt,£)\ < expJ^j £ \H(a,g,b,h,b',ti)\2 b+b ' 

^ a,g,b,h,b',h': 



g,t> 

ctM<a<M/2 



aM<a<M/2 

2 



< exp 2 {M ( 1 + \ + ) m a X (exp 2 1-^2- \ \ (14) 



^ J a M<a<M/2 [ { hgd 



a<g 



< e xP2 {M( 1 + | + ^-^)} 

verifying f|T2|) and completing the proof of Theorem 11.81 The main point, ffTB"]) . is an 
application of Lemma 12.21 Here the constant c depends on a and therefore on p. In 
f }T4|) we use that M < exp{Mlog 2 ci/rf} for all d In f|T5|) we have chosen (7 = aM to 
maximize the exponent. Finally in ( fT6l) we may (for example) take C\ = 43/(ac) and 
C( = 4/(ac), and use both inequalities in ((2]). The final constant d depends only on 
c and a and therefore only on p, as claimed. 



To prove Lemma I2.2[ we use a notion of approximation introduced in [25J . An 
approximation for A C £ is a pair (F, S) C O x £ satisfying 

FCAT(A), SDL4], 
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dp{u) > d — v^d \/u E S 

and 

<fe\s(u) >d-Vd \fveO\F. 

For A C O we make the analogous definition. 

The following lemma is from [T4] (a combination of Lemmata 3.2 and 3.3). We 
use the shorthand ( < * fc ) for J2o<i<h (!) ■ 

Lemma 2.3 Let G be a d-regular bipartite graph with 2M vertices. For each a and 
g set 

A(a,g) = {AC£:\[A]\=a,\N(A)\=g}. 
There is a family W = W(a, g) C 2° x 2 f 



|W| < 



M \ /2# log d\ f2d 3 g log d\ / 2g log d 



sitc/i t/iat ever?/ A G *4.(a, g) has an approximation in W. The analogous result holds 
with O replacing £ in the definition of A(a,g). 

Remark 2.4 If a and g satisfy g — a > d~°g for some constant f3 then using ([?]) the 
bound on |W| from Lemma \2. 3\ may be rewritten as 

f 5Mlog 2 d (6/3 + 17) - a) logd 
|W| < exp 2 <^ h 



d 

as long as d is sufficiently large (as a function of j3). 

Say that a sextuple (P, S, P, Q, P', Q') C0x5x£x0x0x5isan approxi- 
mation for (E,0) G H(a, g,b, h,b' , h') if (F, S) is an approximation for £7, (P,Q) is 
an approximation for J and (P', Q') is an approximation for J. 

Lemma 2.5 Let G, a, g, b, h, b' and h! be as in Lemma 0O1 There are constants 
c i = Ci(Pi) > and c 2 = c 2 ((3i, (3 2 ) > and a family X = X(a, g,b, h,b' , h') C 
2° x 2 £ x 2 f x 2° x 2° x 2 s with 

|15Mlog 2 d ci(p-a)logd Ci(/i-6)logd c 2 (h' - b') logd 

w - exP2 \ — d — + — vs — + — 7^ — + Td — 

such that every (E, O) G "H(a, g, b, h, b', h!) has an approximation in X . 

Proof: We apply Lemma 12.31 (in the form given in Remark 12 .4[) to each of E, I and J 
independently. Note that g—a> d~ /3l g and h—b> d~^h follow from the assumptions 
on 5 in Lemma O (recall \[E}\ = a < M/2 and |[Pj| = |/| < \E\ < M/2), justifying 
the first two applications of Lemma 12.31 and the dependence of c\ on fix alone. 
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For the third application, note that if b' < M/2 we have hi — b' > d 131 hi (recall 
| [J] | = |J|). If V > M/2, then \0 \ N(J)\ < M/2. Since [J] = J we also have 
[O \ N(J)\ = O \ N(J) and N(0 \ N(J)) = £ \ J and so (by the bound on 5) 
(M-b')-(M-h') > d-^(M-b'). Using hi > b' it follows that h'-b' > d~^{M-h'). 
But since N(J) n N(E) = we have hi < M - g < M - a < (1 - /3 2 )M and so 
hi — b' > rf _/3l (l/(l — 2 ) — 1)^' > d~ c h', where the constant c depends on both /3i 
and fi 2 - D 

Before going on to the final step in the proof of Lemma |2T2| we need the following 
simple inequalities ( [TH Lemma 3.1] in the case if> = Vd). K (F,S,P,Q,P',Q') is an 
approximation for (2£, O) E H(a, g, b, h, b', h!) then for suitably large d 

| S |<|F| + fcf>, |«|<|P| + fc« and | Q1 < n + ^. (17 ) 
vrf v ct yd 

Bearing this and the fact that g — a > 5g in mind, Lemma 12.21 is implied by Lemma 
and the following reconstruction lemma. 



Lemma 2.6 Let G, a, g, b, h, b' and h' be as in Lemma \2.S[ There are constants 
c\ = ci(/?i) > and c 2 = c 2 (f3i, f3 2 ) > such that for each (F, S, P,Q,P',Q') C 
Ox£x£xOxOx£ satisfying (Tfy there are at most 



!m t, h> c ^9-a) cjjh-b) c 2 {h'-b') \ 

6Xp 2 <M~b—b : : : : > 

I logct log d log a J 

pairs (E, O) C £ x O satisfying 

F C N(E), S D [E], P C N(I), QDI, P' C N(J) and Q' D J. (18) 

Proof: For notational convenience, write t for g — a, s for h — b and s' for hi — b' . 
Say that S 1 is tt^/it if [ #S'| < g — c[t/logd and s/acA; otherwise, that Q is tight if 
|Q| < & + c^s/ logd, and slack otherwise, and that Q' is tight if < b' + c' 2 s'/ logrf, 
and s/acfc otherwise, where c' x = c'^Pi) > and c 2 = c' 2 ((3i, (3 2 ) > are constants that 
will be specified presently. 

We now describe a procedure which, for input (F, S, P,Q, P',Q') satisfying ( fTTl) . 
produces an output (E,0) which satisfies f JT8|) . The procedure involves a sequence 
of choices, the nature of the choices depending on whether S, Q and Q' are tight or 
slack. 

We begin by identifying a subset D of E which can be specified relatively cheaply: 
if Q is tight, we pick ICQ with |/| = b and take D = N(I); if Q is slack, we simply 
take D = P (recalling that P C N(I) C £7). 

If 5 is tight, we complete the specification of E by choosing E\D C S\D. If 5" is 
slack, we first complete the specification of N(E) by choosing N(E) \ F C iV(S') \ F. 
We then complete the specification of £7 by choosing E \ D C [E}\ D (noting that 
we do know [E] \ D at this point). 
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Next we turn to the specification of 0. As with E, we begin by identifying a 
subset D' of 0: if Q' is tight, we pick J C Q' with \ J\ = b' and take £>' = N(J); if 
Q' is slack, we simply take D' = P' . From here, we complete the specification of O 
by choosing O \ D' C O \ (N(E) U D') (recall that E ^ O). 

This procedure produces all pairs (E,0) satisfying ffT8]) . Before bounding the 
number of outputs, we gather together some useful observations. 

First note that as established in the proof of Lemma 12.51 we have 

g-a>d-^g, h-b><T Pl h and ti - b' > d- c ti (19) 
where the constant c > depends on both /5i and 02, while from ffTTl) we have 

\S\ < 2g, \Q\ < 2h and \Q'\ < 2ti (20) 

for suitably large d. 

If Q is tight then there are at most 

£ WW* £ 

i<cis/logrf Vl 1 7 i<cis/logd 

possibilities for D (for sufficiently small choice of the constant c' 1; depending on (3i), 
and in this case \D\ = h. Here we are using (jTJ) and ffl9|) . If Q is slack there is just 
one possibility for D, and in this case (using ffTTj) ) 

|D| > b + c[s/logd-3s/Vd> b + c' 1 s/2\ogd (21) 

for suitably large d. 

Similarly if Q' is tight then there are at most 2 s I 2 possibilities for D' (for suitably 
small c' 2 depending on both (3± and here we use (EE5J) ), and in this case = /i'; 
while if Q' is slack there is just one possibility for D', and in this case \D'\ > b' + 
c^s' /2 log ti for suitably large d. 

If 5 is slack then (TTT]) implies \N(E) \ F\ < 2c' l t/\ogd and since \N(S) \ F\ < 
d\S\ < 2dg (see f[2"0l ) the number of possibilities for N(E) \ F is at most 

E 

i<2c' 1 t/logd 

for suitable small c[ depending on {3± (here again we use (|T9l ). 

Now we assume that d, c[ and c' 2 are suitably chosen so that all the previously made 
observations hold. We bound the number of outputs of the procedure, considering 
first the four cases determined by whether S and Q are slack or tight, and then 
considering the two cases of whether Q' is slack or tight. If S and Q are both tight 
then the number of possibilities for E is at most 

exp 2 {(s/2) + {g- c[t/logd- h)} < exp 2 {g - c[t/logd- b - s/2}. (23) 




2gd 



< 2*/ 2 



(22) 
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(The first term in the exponent on the left-hand side corresponds to the choice of D 
(using (f2T|) ). and the second to the choice of E\DC.S\D (note that since S and 
Q are both tight, \S\D\< g- c[t/\ogd- h). 

If S is tight and Q is slack then the total is at most 

exp 2 {# - cit/logd- b - ^s/2\ogd}. (24) 

(Here there is no choice for D, and the exponent corresponds to the choice of E\D C 
S\D (using PU).) 

If Q is tight then \[E] \ D\ = a — h, so that if S" is slack (and Q tight) then the 
number of possibilities for E is at most 

exp 2 {(s/2) + (t/2) + (a - /i)} < exp 2 {# -t/2-b- s/2}. (25) 

(The first term in the exponent on the left-hand side corresponds to the choice of D 
(using (|2"T|) ). the second to the choice of N(E) \ F (using ( 12"2"|) ) and the third to the 
choice of E\D.) 

If Q is slack then \[E] \ D\ < a — b — c[ s/2 log d (see ( T2T]) ). so that if S and Q are 
both slack the number of possibilities for E is at most 

exp 2 {(t/2) + (a-b-c' 1 s/2\ogd)} < exp 2 {# - t/2 - b - c v s/ logrf}. (26) 

(The first term in the exponent on the left-hand side corresponds to the choice of 
N(E) \ F and the second to the choice of E \ D.) 

Now we consider the number of choices for O, given our choice of E. Note that 
O C (O \ N(E)), a set of size M — g. If Q' is tight then the number of possibilities 
for O is at most 

exp 2 {(s72) + (M - g - h')} < exp 2 {M - g - b' - s'/2}. (27) 

(The first term in the exponent on the left-hand side corresponds to the choice of D', 
and the second to the choice of O \ D' C O \ (N(E) U D').) 

Finally if Q' is slack then the number of possibilities for O is at most 

exp 2 {M - g - b' - c' 2 s'/2 logrf}. (28) 

(Here there is no choice for D', and the exponent corresponds to the choice of 0\D' C 
0\(N(E)UD').) 

Combining (pi]h ()27j) and we obtain the lemma. □ 
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